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ABSTRACT 

We present a new method of estimating the distribution of sales rates of, e.g., book titles at 
an online bookstore, from the time evolution of ranking data found at websites of the store. The 
method is based on new mathematical results on an infinite particle limit of the stochastic ranking 
process, and is suitable for quantitative studies of the long tail structure of online retails. We give 
an example of a fit to the actual data obtained from Amazon.co.jp, which gives the Pareto slope 
parameter of the distribution of sales rates of the book titles in the store. 
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1 Introduction. 

Internet commerce has drastically increased product variety through low search and transaction 
costs and nearly unlimited inventory capacity. With this new possibility a theory |Anderson, 2 006 
has been advocated which claims that a huge number of poorly selling products (long tail products) 
that are now available on internet catalogs could make a significant contribution to the total sales. 
In this paper, we refer this theory as the possibility of long tail business. 

In studying the possibilities of long tail business, we need a precise, quick, and costless quanti- 
tative method of analyzing the long tail structure, but there we encounter a problem. For example, 
online bookstores have millions of books on their electronic catalogues, but many of the books have 
average quarterly sales less than 1. This means that if we start collecting the sales record, we will 
end up, after waiting for 3 months, with a list which has ten thousand lines with sale and another 
ten thousand with 1 sale, and so on. Moreover, the result will not mean that a particular book 
with 1 sale has a better potential sales ability than a book with sale: A problem characteristic 
of quantitative analysis of long tail business is, that for product items of low sales potentials, fluc- 
tuations dominate in the observed data. Even though we want to suppress fluctuations, since each 
item produces very little profit, we cannot afford to spend time and money in collecting extensive 
data over a long period required from the law of large numbers. 

If we hope to estimate the total sales of a store, we could obtain it from an observation in 
a short period with less relative fluctuations, thanks to the law of large numbers. For a revenue 
officer, this may be sufficient. But for those who we are interested in the long tail business, for 
example, an executive running the online store or a stockholder waiting for disclosure, as well as 
an observer for research purpose, a detailed structure of the contribution of less sold items would 
be important. More specifically, we would like to know the distribution of sales potentials of the 
products at an online store, such as the ratios of the number of items with average sales rate below 
any given number. As discussed in the previous paragraph, extracting the average sales rate of 
an item would require a long time of observation. One would then consider observing sufficiently 
many items of relatively low sales and calculate an average, to suppress statistical fluctuation, but 
then one faces a problem of selecting product items of similar sales potential, and we come back to 
the problem of statistical fluctuation for the data on a single item in the long tail regime. 

On web pages, various ranking data can be found. An example is the sales rankings of books 
at online bookstores such as Amazon.com. On the web page of each book, we see, as well as 
the title, price, and description of the book, a number ranging from 1 to several millions which 
indicates the book's relative sales ranking at the online store. In this paper, based on the analysis 
of a mathematical model defined and studied in |K.&T. Hattori, 2008a , K.fcT. Hattori, 2008b| , we 



propose a new and simple method, using the ranking data, to overcome the problem of statistical 
fluctuations of the data on items with low sales potential. Our method allows us, by observing 
how the sales ranking of a single product develops with time, to reproduce the distribution of sales 
potentials of all the products sold at the online store, free of statistical fluctuations. Our theory 
could serve as an efficient and inexpensive method of a prompt analysis of long tail sales structure. 
The plan of the paper is as follows. In Section [2] we review the model of stochastic ranking 



process, and explain the main theorems in K.&T. Hattori, 2008a, K.&T. Hattori, 2008b . To test 



the applicability of our theory in practical situations, we apply in Section [3] the formulas summarized 
in Section [2] to the rankings at Amazon.co.jp. In Section [4] we discuss further implications of 
the theory of the stochastic ranking process and possible implications of the results obtained at 
Amazon.co.jp. 
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2 Formulation. 

In this section, we summarize the main results in |K.fcT. Hattori, 2008a| on the stochastic ranking 
process. It is a simple model that describes the time development of sales rankings at online 
bookstores. 

Consider a system of N items (say, book titles), each of which has a ranking ranging from 1 
to N so that no two items have the same ranking. Each item sells at random times. Every time 
(a copy of) an item sells, the item jumps to rank 1 immediately. If its ranking was m before the 
sale, all the items that had rank 1 through m — 1 just before the sale shift to rank 2 through m, 
respectively. Thus, the motion of an item's ranking consists of jumps to the top and monotonous 
increase in the ranking number between its own sales, caused by the sales of numerous other items. 

We prove that under appropriate assumptions, in the limit N — > oo, the random motion of each 
item's ranking between sales converges to a deterministic trajectory. This trajectory can actually 
be observed as the time-development of a book's sales ranking at Amazon. co.jp's website. Simple 
as our model is, its prediction fits well with observation and allows the estimation of the Pareto 
slope parameter. We also prove that the (random) empirical distribution of this system (sales rates 
and scaled rankings) converges to a deterministic time dependent distribution. 

To formulate the model mathematically, let us introduce notations and state assumptions. Let 
i = 1, • • • N be the labels that distinguish the items. We denote the sales ranking of item i at time 
t by X^ N \t), for i = 1,2, ■ • ■ , N. Assume that a set of initial rankings = X^ N \o), satisfying 

x i (0) 7^ x i' (0) f° r * 7^ an d sales rates w^^ > are given (non-random). Namely, items with 
various sales rates (selling well or poorly) start with these given initial rankings x^ 1 , and set out 

to motion according to their sales rates. Let q = and t^j, i = 1, • • • ,N, j = 1, 2, • • •, be the 
j-th sales time of item i, which is a random variable. Assume that sales of different items occur 
independently, and furthermore, for each i, the time interval between sales {r i ^ 1 — T^'}j = x t 2,— 

are independent and have an identical exponential distribution to that of rfp given by 

(JV), 



A property of exponential distributions implies that corresponds to the average number of 

sales per unit time. In the time interval (t-^\ t^^) the ranking (t) increases by 1 every time 

one of other items in the tail side of the sales ranking (i.e., with larger (t)) sells. Thus, the 
stochastic ranking process is defined as follows: for i = 1, • • • , N, 

(i) ^ (iV) (o) = C 

(ii) Xf)(^f) = l, j = l,2,.., 

(iii) for each i' + i and / = 1,2,---, if x\ N) (t^, - 0) < xf\r^) - 0) then xf\rf}) = 
x\ N \t^, — 0) + 1, where t-, N J — means 'just before' time t^J, 

(iv) otherwise (£) is constant in t. O 



Since sales rankings are determined by random sales times, sales rankings are also random variables. 

Let Xq \t) = §{i | i 5= t}, where %A denotes the number of the elements of a set A. Xq({) 
is the number of the items which has sold at least once by time t. Note that in the ranking queue of 
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items, the item with rank Xq (i) marks a boundary; all the items with X- (t) ^ x@(t) ('higher' 
rankings) has experienced a sale, while those with X^ N \t) > x^\t) ('lower' rankings) have not 
sold at all by time t. 

We can also see x^\t) + 1, ^ t ^ T as the trajectory of the sales ranking of an item that 
started with rank 1 at time and has not sold by time T. It is convenient to consider the scaled 

trajectory defined by y^\t) = —x@ {t), for it is confined in the finite interval [0, 1]. The scaled 
trajectory is random, but the following proposition shows that this random trajectory converges to 
a deterministic (non-random) one as N — > oo. 

Recall that item i has sales rate itfj • This determines the empirical distribution of sales rate 
1 N 

as \( N \dw) = — ^2 d w ( N )(dw), where S c with c£M denotes a unit distribution concentrated at c. 

i=l 

Namely, for any set A C [0, oo), 



S c (dw) 



1 , if ce A, 
, if c A. 



Proposition 1 Assume that the empirical distribution of sales rate converges as N — > oo 

weakly to a distribution A. Then 

y ( c N) (t)^y c (t) (1) 

in probability, where 

poo 

yc(t) = 1-1 e- wt X(dw). (2) 

o 



(I 



This proposition is a straightforward result of the law of large numbers. Intuitively, the stochastic 
process Uq' converges to the deterministic curve yc because a trajectory of an item between the 
point of its sales is determined by the independent sales of numerous others (towards the tail side 
of the book in observation in the ranking). The popularity of the observed book is reflected in the 
length of sojourn in the sequence before it makes next jump (i.e., ordered for sales.) 

Remarks. (i) The random variable y@ (t) converges as N — > oo to a deterministic quantity yc(t)- 
It implies that if N is large enough, the scaled trajectory provides us with fluctuation-free 
information. If we try to know the sales rate of each product by counting the sales for a 
certain period of time, we cannot avoid fluctuation. The more precise data we want, the 
more time is needed to count the sales, especially for items that rarely sell, say, once a month. 
This proposition ensures that by observing the time development of the sales ranking of a 
single item, we can reproduce the distribution of sales rates, free of statistical fluctuation. 



oo 

wt - 



(ii) L(t) = I e w X(dw) on the right-hand side of ([2]) is the Laplace transform of the distribution 
Jo 

A. There is a uniqueness theorem according to which the Laplace transform completely 



determines the distribution Billingsley, 1995 . O 



Intuitively, we can guess that near the top of the ranking, there are more items with large sales 
rates than in the tail regime. This intuition can be made mathematically precise and rigorous: 



Theorem 2 Assume the following: 
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(1) The combined empirical distribution of sales rate and the initial scaled sales rankings y\ g 



- (x {N) 



/iJJJ (du> dy) = ^ ^ <5 w (iv) (die) 8 m) (dy) 



i 

converges as N — > oo to a distribution n yt o(dw) dy on IR + x [0, 1] which is absolutely continuous 
with regard to the Lebesgue measure on [0, 1]. 

(2) A({0}) = 0D 

/•oo 

(3) I w\(dw) < oo. 

J 

Then the combined empirical distribution of sales rate and scaled rankings (t) = — (x\ N ^ (t) — 
1) 

A*£P ( dw d v) = ^2 5 w\ N) ( dw ) 6 Y t (N) (t) ( dy ) 

i 

converges as N —* oo to a distribution fiy^dw) dy which is absolutely continuous with regard to the 
Lebesgue measure on [0, 1] . 

In particular, the ratio of items with < a ^ w ^ b and rankings in [0, y] C [0, 1) at time t is 
given by 



y 

Hz,t([a,b]) dz 



o 



(l-e- wt °W)\(dw), y<yc(t), 

b fb ry(y,t) (^) 

(1 - e- wt )\(dw) + / e~ wt / n xfi {dw) dz, y > y c (t), 



where to(y) is the inverse function of the strictly increasing continuous function yc(t) : 

ycMy)) = y, ^ y < l, (4) 



1 poo 

wt , 



and y(-,t) is the inverse function of yc(y,t) = 1—1 / e~ w ^ z ^{dw) dz., which is a strictly 

Jy Jo 

increasing continuous function ofy. 

Furthermore, the trajectory ■hX? N '(Ti t j + t), time-shifted by T{j, converges as N — > oo to yc(t) 
given in Proposition^ up to the next jump time ( t 5^ T i,j+i — Hi )■ ^ 

Remarks. (i) Assumption (1) says that in actual applications we are considering a long tail econ- 
omy with a large number of items N 3> 1, and that we may regard the empirical distribution 
^■yO a ^ ^ ne starting point of observation as a continuous distribution. 

(ii) Assumption (2) implies that all the items sell. With extra notations Theorem [2] essentially 
holds without Assumption (2), but we will keep it to avoid complications. 

This assumption implies that yc is a strictly increasing function of t, and the inverse function 
to : [0,1) — > [0, oo) exists. Under Assumption (2), yc(y,t) is a strictly increasing function of 
y, thus the inverse y(-,t) : [yc(t), 1) — ► [0, 1) exists. 

(iii) Assumption (3) assures the explicit form of the limit Q in the following Theorem to hold also 
for y = 0. For y > the Theorem holds without Assumption (3). (Hence the only essential 
assumption is the Assumption (1).) 
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(iv) The last statement in the Theorem implies that by observing the time development of the 
ranking xc(t) of any single item from the moment of its sales point (ccc(O) = 0), we can, 
by equating ycif) = xc(t)/N with ([2]), obtain the information on the distribution of sales 
potential {u>i}, of all the items listed in the rankings. 

(v) This Theorem is mathematically nontrivial in the sense that a law of large numbers of 'de- 
pendent' random variable is the key to the proof. 

It is also known that fj, y j(dw) satisfies the following set of partial differential equations: For 
any measurable set A C [0, oo), 

dfiy,t(A) d(v(y,t)fjiy 7t {A)) _ /j„„\ dv 



dt dy 



f dv f°° 

/ wny jt (dw), —{y,t) = - wn V) t{dw) 
J A cy Jo 



For mathematical details, see K.fcT. Hattori, 2008a K.feT. Hattori, 2008b . O 



In the subsequent sections we consider the stochastic ranking process as a model for the rankings 
found, for example, at the web sites of an online bookstore. We regard an item in the model as a 
book title, and the jump time to rank 1 as the time that the title is ordered for sale. According 
to the definition of the model, we assume that each time a book is ordered the ranking of the title 
jumps to 1, no matter how unpopular the book may be. At first thought one might guess that such 
a naive ranking will not be a good index for the popularity of books. But thinking more carefully, 
one notices that well sold books (items with large toj , in the model) are dominant near the head 
of the ranking, while books near the tail are rarely sold. Hence, though the ranking of each book 
is stochastic and has sudden jumps, the spacial distribution of jump rates are more stable, with 
the ratio of books with large jump rate high near the top position and low near the tail position. 
Seen from the bookstore's side, it is not a specific book that really matters, but a totality of book 
sales that counts, so the evolution of distribution of jump rate is important. Theorem [2] says that 
we can make this intuition rigorous and precise, with an explicit form of the distribution when the 
total number of titles in the catalog of the bookstore is large (i.e., in the large iV limit). 



3 Application to sales analysis of Amazon.co.jp. 

In this section, we give an explicit example of how the theoretical framework in Section [2] could 
be applied to realistic situations. We will focus on the sales ranking data found at the websites of 
Amazon.co.jp, the Japanese counterpart of the online bookstore Amazon.com. 

We first give in Section \3. II a brief explanation about the sales ranking number found at the web 
pages for Japanese books at Amazon.co.jp, and summarize in Section [3.21 the method of applying 
Section [2] to actual ranking data, and give an explicit result of statistical fits of the distribution of 
sales rate of the books at the online bookstore. 



3.1 Amazon.co.jp book sales ranking. 

The web sites of Amazon (irrespective of countries) have a web page for each book title, where 
we find, as well as its title, author and price, a number which represents the sales ranking of the 
book. It has been noticed Chevalier etal., 2003, Brynjolfsson etal., 2003 ] that this number serves 
as an important data for quantitative studies of the economic impact of online bookstores. This is 
because the number reflects the sales rate of the book, and especially in the situation that, in terms 
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of |Brynjolfsson etal., 2003| , 'Internet retailers are extremely hesitant about releasing specific sales 
data', it can be one of the scant data publicly available. 

We refer to |Chevalier etal., 2003 j for general structure of the web pages, and to [R osenthal, 2006 
for a summary based on apparently a long and extensive observation of the ranking number at Ama- 
zon.com, and in particular, discussion on its relation to the actual sales of the book at Amazon.com. 
Here we focus on observed facts about the time evolution of ranking numbers at Amazon.co.jp. 
Firstly, it is said that Amazon.com adopts an involved definition of the ranking numbers than the 
stochastic ranking process. Secondly, Amazon.co.jp is easier for the authors to find appropriate 
data (it is our home country). 

If we keep observing the ranking number of a book, we soon notice that it is updated once per 
hour regularly. For a relatively unpopular book title, the corresponding ranking number increases 
steadily and smoothly for much of the time as the number is updated, but once in a while we see 
a sudden jump to a smaller number around ten thousand. This happens when a copy of the book 
is ordered for purchase, which can be checked by personally ordering a copy at Amazon website; 
at the update time which is 1 - 2 hours after the order, the ranking number is observed to jump. 
Actually, except for the top ten thousand sellers out of a few million Japanese book titles catalogued 
at Amazon.co.jp, a book sells less than 1 per hour on average, hence the qualitative motion just 
described hold for 99 percent of the book titles at Amazon.co.jp. 

Note that this behavior of the time evolution of a ranking number is similar to that of stochas- 
tic ranking model in Section [2j The correspondence is also natural from an observation by 



Rosenthal, 2006 that the Amazon's ranking number system 'is based almost entirely on "what 
have you done for me lately"'. For seldom sold books, any natural definition of the ranking number 
satisfying such a criterion would be in the order of latest sales time, because any sales record before 
the latest one should be further remote past and would have only a small effect on any reasonable 
definition of the ranking number. Hence the definition of the stochastic ranking process in Sec- 
tion [21 even though it may have sounded over-simplified, has a chance of being a good theoretical 
basis for modelling the ranking numbers on the web, especially for probing a large collection of 
titles in the long tail regime of the catalog, which is of interest in this paper. 

If we further assume as usual that the point of sales are random, then we will have a full 
correspondence between the stochastic ranking model and the time evolutions of ranking numbers 
at Amazon.co.jp. Based on the correspondence, we give, in the next subsection Section \3. 21 explicit 
formulas which relate a time evolution of a ranking number xc(t) to a distribution of average sales 
rate of the book titles at the bookstore, and then using the formulas we give results of fits with 
observed data. 



3.2 Stochastic ranking process analysis of book sales ranking. 



We start with a standard assumption, as, for example, in Chevalier etal., 2003[[Bry njolfsson etal., 2003 



that the probability distribution of book sales rate is a Pareto distribution (also called a power law 
or a log-linear distribution). In the notations of Section [2] this means that we assume the probability 
measure A to be ^ 

A(Koc)) = ( (j ' wa > (5) 
[ 1, w < a, 

where a and b are positive constants. Its probability density function is given by 

ba b 



dX 



w 
0, 



6+1 ' 



w ^ a, 
w < a. 



(6) 
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In terms of books, w denotes the average sales rate of a book on the list of a bookstore; a book with 
w sells on average in the long run w copies per unit time. A is the distribution of w; for example, 
A([u>, oo)) is the ratio of the number of book titles with sales rate w or more to the total number of 
titles. Alternatively we could start with another (discrete) formulation of the Pareto distribution 



1/6 



Wi = a(-j , % = 1,2,3, N, (7) 



where the constant a in (or in ([5])) denotes the lowest positive sales rate among the book titles 
at the store. Note that the books that never sell should be omitted in applying our theory. N is 
the total number of such titles as actually sell catalogued at the online bookstore, and w% is the 
average sales rate of the i-th best seller. The ratio of titles with w or more average sales rate is 
then 

—A{i\ Wi ^ w } = —${i\i^N[-)} 



for w ^ a, reproducing ©. 

The exponent b (—1/b corresponds to the Pareto slope parameter) is crucial in the analysis 
of economic impact of the retail business in question. In fact, previous studies using the ranking 



numbers at the online bookstores |Chevalier etal., 2003 Brynjolfsson etal., 2003 use the data for 



extracting the exponent b, which then was used to study various aspects of economic impact of 
the online bookstores. An intuitive meaning of the exponent b can be seen, for example, by taking 
ratio of ([7]) for i = 1 and N, to find 

^ = (8) 

w N 

which roughly says that for large N if b is small then w\ is very large compared to wn, so that 
the greatest hits dominate the sales, while if b is large the contributions are more equal, and since 
there are many unpopular titles, their total contribution to the sales may dominate (the 'long tail' 
possibility) . We will discuss further on the implications of the parameter b in Section 01 

Our method of obtaining the parameters a and b is to observe a time development of the 
ranking of any single book title, which contains information of A, with statistical fluctuations 
strongly suppressed. (One may be curious why a data from a single title could have fluctuation 
suppressed. This is because the time development of the ranking, during the book in question is 
not sold, is a result of the total sales of the the large amount of titles in the tail side of the observed 
book in the catalog of an online bookstore, hence the statistical fluctuation is suppressed by a law- 
of-large-numbers mechanism. This is a practical meaning of the deterministic motion appearing as 
an infinite particle limit stated in Section [2j) Substituting © in ([2]) we have 



f-OO 

yc (t) = l-ba b e-^w-^dw = 1 - b(at) b T(-b, at), (9) 

J a 

roo 

where Y is the incomplete Gamma function defined by T(z,p) = / e~ x x z ~ 1 dx. Since b is positive 

Jp 

r(— b, at) — > oo as t — » 0. This divergence is mathematically harmless because of the factor t b , but 
from a practical point of view, it is convenient to use the integration-by-parts formula 

T(z,p) = -z- l p z e~ p + z- x T(z + l,p) (10) 

to obtain 



y c (t) = 1 - e~ at + (at)'T(l - b, at). (11) 



9 



This formula is satisfactory for < b < 1. For 1 < b < 2 use (|1Q[) again to obtain 

at ^_ at (at) b 



y c (t) = 1 - (1 - — ) e~ at - ^L. r(2 - b, at). (12) 



In principle, we may perform integration by parts as many times as required, though we did not 
come across values b ^ 2 in the literature or in our data. For b = 1, we need a slightly different 
formula with 'logarithmic corrections', but we have not observed any practical evidence that the 
exact value of b = 1 occurs, so we will always assume b ^ 1 in the following, to simplify the formulas. 

Note in particular, that implies that for b < 1 we have a concave time dependence for short 
time, 

y c (t) = (at) b T(l-b,0) + o(t b ), 

while (I12j) implies that for b > 1 we have linear short time dependences. According to the results 
in Section [21 yc(t) is the relative position (i.e., ^ ycif) < 1) a t time t in the ranking of the title 
which was at the top position (i.e. sold) at t = 0. The corresponding ranking number xc(t) is 
given by 

x c (t) ~ N y c (t) = N (1 - e- at + (at) b T(l - b, at)). (13) 

where N is the total number of the catalogued titles that actually sell. We cannot control sub- 
leading order in N because of the statistical fluctuations. (The limit theorems in Section [2] assures 
that the leading order is free of statistical fluctuations.) However, since Amazon has a huge 'elec- 
tronic bookshelf of order N = O(10 6 ), we will omit the statistical fluctuations of relative order 
OiVN' 1 ) = 0(1(T 3 ). 

1 N 

Incidentally, we can alternatively start from (JT]) and use the empirical distribution — 5 Wi 

i=l 

for A, where 5 W is a unit distribution concentrated at w. Then from ([2]) we have, by elementary 
calculus, 

i N poo poo 

yc (t) =l-i-£ e ~a(N/^H = 1 _ e - W t ba b / e - w t w - b -l dw + 0(JNr l )j 

Ja J a 

reproducing Q. 

Before closing this subsection, we recall that ([2]) implies that the ranking of an item is, as a 
function of time t, essentially the Laplace transform of the underlying distribution A of the jump 
(sales) rates. If we have a accurate and long enough ranking data (i.e., observation of the time 
evolution of the ranking xc(t) for a very long period and with very fine intervals), the uniqueness 
of inverse Laplace transform assures in principle the determination of A non-parametrically, i.e., 
without assumptions on A such as assuming Pareto distribution ([5|) . This approach however requires 
a very fine data, because the Laplace transform has smoothing effect through e~ wt factor, and a 
small irregular differences in the Laplace transform could result in a large difference in the original 
function. In the case of Amazon.co.jp, which we see in Section \3. 31 the ranking is updated only once 
per hour and we cannot expect fine enough data (as is also the case of Amazon.com), so we will 
follow a standard approach assuming a Pareto distribution for A. (Needless to say, the managers 
in the Amazon company have access to precise real-time data, hence our methods will help them 
analyze and plan the inventory controls and evaluate the sales.) 

If long tail economy expands in the future, and our methods turn out to be of practical use, it 
would be preferable to have real time spontaneous updates of the ranking data, which will make our 
methods more efficient and accurate. (It will not cost any more than the current Amazon's ranking 
data updates with hourly intervals; in fact, the title listings at the 2ch.net adopt such algorithms 



K.&T. Hattori, 2008b| .) 
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3.3 Results from Amazon.co.jp. 

By performing a statistical fit to (|13p of ranking time evolution data, we can in principle obtain 
the parameters a and b which determine the distribution of average sales rates of the book titles at 
Amazon.co.jp. In the practical situations, it turns out that the total number N of the book titles 
also needs to be determined from the data. 

We are aware that Amazon.co.jp publicizes at their website the total number of book titles on 
their catalog, which can be reached by making an unconditioned search at the Amazon website. 
However, the book catalogs at Amazon websites contain books which are not available and therefore 
do not sell, hence, as we noted below equation (JT]) while describing the Pareto distribution, should 
be discarded from our analysis. We have experienced more than once that we order a book at the 
website and receive a note after a while that the book has not been found and that the order is 
cancelled. At the same time, we observe the ranking number of that cancelled title making jumps 
to the tail side. We thus realize that the claimed number of titles at the website contains those 
with w = and is therefore strictly larger than what we should use for N in our formulation. As 
an explicit example, the number from Amazon.co.jp search results was 2,587,571 on Oct. 4, 2007, 
while our fits indicates N to be strictly less than 1 million (see (|14j) ). Now we turn to our results of 



500000 




500 1000 1500 2000 

Fig 1: A long time sequence of data from Amazon.co.jp. The solid curve is a theoretical fit. 
Horizontal and vertical axes are the hours and ranking, respectively. 

observation. The plotted = 77 points in Fig. [1] show the time evolution of the ranking of a book 
we observed between the end of May, 2007 (at which point the book was ordered for sales) and mid 
August, 2007 (at which point the book was bought again). The solid curve is a least square fit of 
these points to (I13p . The best mean-square fit for the parameter set (N,a,b) is: 

(N*, a*, b*) = (8.57 x 10 5 , 3.939 x 10~ 4 , 0.6312). (14) 

Note that N* is large, hence the fluctuations arising from randomness in the sales are relatively 
suppressed (0(l/\/ N*) = O(10~ 3 )), as expected, while the number is smaller than that found by 
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performing a search at the Amazon website (8.57 x 10 5 < 2.6 x 10 6 ), so that a fit of N is necessary. 
a* is in units of [I /hour] and corresponds to 3.5 months for 1/a*, which is longer than the interval 
of observation (2.5 months). Our method allows the determination of time constants longer than 
the interval of observation because there are a large amount of (mostly unpopular) titles which 
theoretically allow a law-of-large-numbers mechanism. (The obtained value of a* does not mean 
that there are no books at all which sells, say, only one copy a year on average; it says that such 
books are much less than would be expected from a log-linear (Pareto) distribution and have a 
negligible economic impact.) 

The total variance x 2 °f t ne data from this fit is x 2 = 1-599 x 10 10 , hence the statistical 
fluctuation Aye of the relative ranking is roughly of order 

Ay c = ^Ax c ~ ^Vx 2 /n d = 0.02. 

This seems a little larger than an expectation from the Gaussian fluctuation which would be of 
order 1/y/N = 10~ 3 . Fig. Q] suggests that a possible reason of the deviations of data from the fit is 
caused by a small jump at about t = 300 hours. We suspect this as a result of inventory controls at 
the web bookstore, such as unregistering books out of print. Apparently, Amazon.co.jp in the year 
2007 was updating their catalogs manually and only occasionally, making it a kind of unknown 
time dependent external source for our analysis. 



500000 




500 1000 1500 2000 2500 3000 3500 4000 



Fig 2: Two long time sequence of data from Amazon.co.jp. One sequence with 77 points is the 
data in Fig. [IJ another one with 27 points. The solid curve is a theoretical fit to the 77 + 27 data. 
Horizontal and vertical axes are the hours and ranking, respectively. 

Concerning the stability of the parameters, we made another series of observation between 
November, 2007 and March, 2008. This time, having less time to spare we recorded only once a 
week resulting in = 27 points. The solid curve in Fig. [2] is a least square fit of the combined 27 
points and the 77 points in Fig. [I] to (|13p . The best mean-square fit for the parameter set (N, a, b) 
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is: 

(N*, a*, b*) = (8.00 x 10 5 , 5.803 x 10~ 4 , 0.7959). (15) 

X 2 = 2.0111 x 10 10 (Aye ~ 0.02) effectively remained same as ([14ft . The parameters have changed 
somewhat; change in the total number of active books N* is not large (about 7%), 1/a* = 2.4 
months which is somewhat shorter than (|14p . The exponent b* is larger, but note that we again 
have exponent b strictly less than 1. 

Though we clearly and consistently have b < 1 (also seen from the concave figure in Fig. [1] and 
Fig. [2|), its value has changed. The change of N* and a* between (114ft and (115ft is consistent with 
a hypothesis that Amazon.co.jp performed inventory controls (as they should do) and got rid of 
books with low sales between the two series of observations, so one explanation is that the exponent 
b also changed. Another possible reason is that the new data of once per week are too sparse and 
that we need finer data for stable fits. In fact, as pointed out at the end of Section [3.21 a fit to the 
distribution may be sensitive to small changes in the ranking data, and a data finer than once per 
week may be required. This problem could be overcome by automated data acquisition through 
computer programming. 

The values b* = 0.6312 in (Q3D and b* = 0.7959 in {TSJ are both less than 1. The result, b* < 1 
obtained from our data may also be convincing by a look at Fig. Q] and Fig. [21 because, as we 
noted below (|12ft . the short time behavior of the ranking is proportional to t b for b < 1 (which 
implies the graph is tangential to the ranking axis), while is linear for b > 1. Previous studies 
|Chevalier etal., 2003 Brynjolfsson etal., 2003| adopt values b > 1. (The correspondences of the 
notations are b = —l/fo for |Brynjolfsson etal., 2003| and b = 9 for |Chevalier etal., 2 003 1. In 



statistics textbooks b = a and a = 1//3 are also used.) According to what we remarked below (jSJ), 
this implies that, in general, the economic impact of keeping unpopular titles at online bookstores 
may be overestimated in the previous studies. We will continue on this point in Section [H 



4 Discussions. 



4.1 Formulas for the long tail structure of online retails. 

In Section [3] we dealt with an application of a formula ([2]) in a practical situation, a prediction on 
the time evolution of the ranking of a book. The theoretical framework in Section [21 introducing 
the main results of |K.&T. Hattori, 2008a| , contains more than this, and predicts the total amount 
of sales (per unit time) that could be expected from the items (e.g., books, in the case of an online 
bookstore) on the tail side of any given ranking number m ^ N. 

Note that this is not equal to the total contribution to the sales from the tail side aligned 

N 

in order of potential (average) sales rate, which is i0j in the notations in Section [3j This is 

i=m 

because, since the ranking number jumps to the head each time the item sells at a random time, 
and since there are a very large number of items (N S> 1), we always have some lucky items with 
low potential sales around the head side of the rankings, and according to a similar argument, we 
also must have some 'hit' items towards the tail side. The main theorem in |K.&:T. Hattori, 20 08a |, 
as explained in Section [21 states that the ratio of such (un-)lucky items having ranking numbers 
very different from those expected from their potential sales ability Wi is non-negligible even in the 
N —* oo limit. 

An explicit formula can be derived from ([3]). Note that ([2]) and Assumption (2) for Theorem [2] 
imply lim yc(t) = 1, hence after a sufficiently long time since the start of the bookstore and its 

t— >oo 
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ranking system, one may assume that the ranking reaches a stationary phase and the first equation 
in ([3]) holds for all ^ y < 1. Letting a = w and b = w + dw in ([3]) we have 

H z ,t{dw) dz = (l- e~ wt °W) X(dw). (16) 

z€[0,y] 

Let < r\ < r<i ^ 1, and denote by S(n,r2) the contribution to the total average sales per unit 
time from the items with ranking number between r\N and r2-/V '. For a very large N, we may let 
N — > oo and use ([16]) to find 



lim — 5(ri,r 2 ) = / wiu, z>t (dw) dz 

N ^°° iv J(to,z)e[0,oo)x[n,r2l 

= / w[j, Z: t(dw) dz — / w/j, Zi t(dw) dz (17) 

•/ (i«,z > )GfO,oo)x [O.ral ./(w,,z > )e[0,oo)x fO,nl 



This is valid for an arbitrary sales rate distribution A; for the Pareto distribution © we have, using 
the incomplete Gamma function as in ([9]), 

lim -j-S(n, r 2 ) = aft (r(l - 6, g(n)) (/(n) 6 " 1 - r(l - 6, g(r 2 )) g^) 6 " 1 ), (18) 

where g(r) = at (r) is given by ((1]) with (fTTj) : 

r = 1 - e -"M + g(r) 6 T(l - b, q(r)). (19) 
For 1 < b < 2, a better expression using (jlQI) as in (I12p would be 

lim i§(ri,r 2 ) = (e^O -T(2-b, q(n)) q^f' 1 -e~^ +T(2-b, g(r 2 )) q^f' 1 ), (20) 

at— >oo Jy o — l 

with 

r = 1 - e"«M (1 - ^) - |^ r(2 - 6, g(r)). (21) 

S{r\,r2) is to be compared with the contribution S(ri,T2) to the total average sales per unit 
time from the items % between r\N and r 2 N ordered in decreasing order of potential sales rate Wi, 
as in ([7]). We have, 



ab ( Jb-i)/b (b-l)/b. 



[ x r 2 N 1 r 2 N m 

lim — Sfri-ro) = lim — W; = lim — a I — | = a x^^d. 

N->ooN V ' N-^ocN ^ N->ooN ^ \% ) L / 99 x 



Note that g(0) = and q(l) = oo. The latter is from ([9]): 

/oo 
e -v( r )y y -b-i dy 

The last term is a convergent integral for b > 0, which is proved by (|19j) for < 6 < 1 and by (|2T 
for 1 < 6 < 2. It converges to as q(r) — > oo. 



14 



The special case of r% = 1 corresponds to the contribution from the tail side in the ranking for 
S(r, 1) and the tail side in the potential sales rate for S(r, 1) (the 'long tail'), which are (after some 
elementary calculus as above) 



lim ^-S(r,l) =abT(l-b,q(r))q(r) b - 1 
"'" (e-^-T(2-b,q(r))q(r) b - 1 ), 



(23) 



6-1 

with q(r) given by (119p or (|2ip . and 

lim ls(r,l) = -^-(l-r^y b ). (24) 

N^oo TV — 1 

Concerning the contributions from the head side ('great hits'), we note that the cases b > 1 and 
b < 1 are different. This is easy to see in (1221) . where we find lim lim — SYri,^) = oo if b < 1, 

ri— H-OjV— >oo N 

while for b > 1, we can safely take r\ —* limit to find 

lim l5(0,r) = — rM/ 6 
AT^oo N 6-1 

This quantity represents an average sales rate per unit time per unit item, which is finite for the 
realistic situations. For 6 < 1 great hits dominate in the total sales, which theoretically becomes 
infinitely large as ./V — > oo (see flTJ)), while for 6 > 1 all the items contribute non-trivially, and 
that with a large number of items, the contribution from the 'long tail' would dominate, which 
intuitively explains the difference in the behavior. The divergence is a result of N — > oo limit. We 
will consider cases 6 > 1 and 6 < 1 separately and discuss the implication of the value of 6 in detail. 

4.2 Implications of the Pareto exponent b. 

We noted at the end of Section 14.11 and also below j8j that large 6 means that the 'long tail' is 
important while small 6 means that great hits dominate. Intuitively, there are O(l) great hits 
and 0(N) long tail items, so the ratio of the contribution of the former to the latter is, using ([8]), 

0( — ) = AT 1 / b— 1 , hence when the total number of items N is large, the dominant contribution 

K w N x N' 

to the total sales change between 6 > 1 and 6 < 1. 

4.2.1 Case 6 > 1: The long tail economy. 

Let 6 > 1 and assume N is large. 

For ^ r ^ 1, the contribution to the total sales per unit time of the Nil — r) items (out of 
the total N) with low sales potentials is given by (|24l) : 

S ( n l)~i^(l_ r (6-i)/6). (25) 
In particular, the total sales per unit time at the online store is 

S tot = 5(0,1)-^-. (26) 
Subtraction gives us the total sales amount from the Nr top hits per unit time: 

5( ,r)^r^. (27) 
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Similarly, (|23p gives the contribution to the total sales per unit time from the N(l — r) items 
in the tail side of the ranking: 



(28) 



5(r, 1) ~ NabT{l - b, q{r)) q^f' 1 = {e~ q{r) - T(2 - b, q(r)) qir)^ 1 ); 

-i-e^d-g)-gr(2-MM). 

In particular, noting q(0) = and 

/OO f'OG 1 

e -Wj,-6 d y _> y y-6 d y = __ , g _» , 

we have 5(0, 1) = - — - for b > 1, which is equal to (|26p as expected, because all the items in the 

store are listed on the ranking. Subtraction gives us the total sales amount from the top Nr items 
in the ranking (at any given time, if the ranking is stationary) per unit time: 

5(0, r) ~ Nab (1 - T(l - b, q(r)) qir)^ 1 ) = (1 - e~ q{r) + T(2 - b, q(r)) q(r) b ~ r ). (29) 

The large b implies that there is a good chance in the long tail business. For example, for 
a extreme case of b = 2, fl27|) implies 5(0, 0.2) /5(0, 1) ~ V(L2 ~ 0.447, so that top 20% of hit 
items contribute only 45% of total sales, far less than 80% , challenging the widespread '20-80 
law'. This is, however, too extreme, and we should use realistic values. Concerning the analysis 
based on the rankings of Amazon.com, Chevalier and Goolsbee Chevali er etal., 2003| explored a 
number of sources of information, including their own experiment, and obtained values for the 
exponent b ranging from 0.9 to 1.3, and adopted the value b = 1.2 for their subsequent calculations, 
to find, for example, that the online bookstores have more price elasticity than the brick-and- 
mortar bookstores and have a significant effect on the consumer price index. Brynjolfsson, Hu, 
and Smith |Brynjolfsson etal., 2003| uses b = 1.15 (—1/6 = P2 = —0.871 in their notations), to 
evaluate the increase in consumer welfare by the introduction of large catalogues of books by the 
online bookstores. They also quote the values in Chevalier etal., 2003| and report a result of 



similar experiment to obtain b = 1.09. For b = 1.2 and b = 1.15 we have 5(0.2, l)/Stot — 0.235 
and 5(0.2, l)/5t t ~ 0.189, respectively, behaving more or less like '20-80 law'. Of course, we are 
considering N of order of million (or more, with the advance in the web 2.0 technologies and online 
retails expected in the close future) distinct items as in (fT4"j) or (fT5|) . and top 20% also means a 
large number. The term 'possibility of the long tail business' makes sense for b > 1, in the sense 
that, with a drastic decrease in the cost for handling a large inventory through online technology, 
a retail with a million items on a single list produces a large profit. 

Let us return to (|28p and consider the role of the stochastic ranking process in inventory controls. 
As an example, consider a situation where an online store is to open a new brick-and-mortar store 
with r N items out of item sold at the online store. If the manager knew the average sales rate Wi 
of each item i = 1, • • • , N (for example, based on past records at the online store), he would choose 
the top rN items and the expected decrease in the total sales (per unit time) compared to the 
online store will be 5(r, 1). (wi will usually be estimated based on past record of sales, and there 
is a potential problem, as expressed in the Introduction, that for items with small Wi, one would 
have small sales records, and statistical fluctuations obscure precise determination of wi in the long 
tail regime. How the managers find way out in this approach is beyond the scope of this paper.) 
Now if the manager considered it quicker to select top rN items in the ranking at the online store, 
what would be the extra loss? In this case, the expected decrease in the total sales (per unit time) 
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Fig 3: Ratio of contribution to the total sales from lower N(l — r) items in the ranking to that from 
lower iV(l — r) items in the sales potential. The upper and the lower curves correspond to b = 1.15 
and b = 1.2, respectively. The horizontal and vertical axes are r and S(r, 1)/S(r, 1), respectively. 



will be S(r, 1), so the ratio S(r, 1)/S(r, 1) measures the extra loss from the use of ranking number 
in place of sales rate. Fig. [3] shows this ratio as a function of r for 0.1 ^ r ^ 0.9, calculated using 
([28]) . As a value of b we adopted the values from |Chevalier etal., 2003 jBrynjol fsson etal., 2003| . 
The ratio turned out to be insensitive to r in this range and shows 35% to 40% increase. (For r 
near and 1, the ratio approaches 1, and the use of ranking data is better. For large b the ratio 
also approaches 1, and we also found that the ratio is not sensitive up to b close to 1.) This shows 
an example of the use of ranking data as simple and effective measure of analyzing sales structure 
of the long tails. 



4.2.2 Case b < 1: The great hits economy. 

Now let b < 1 and assume N is large. 

As noted at the end of Section [4.11 when we are considering sales for b < 1, taking N — > oo 
limit results in unrealistic infinities on average sales (sales per item), arising from divergence of 
great hits. Explicitly, from ([7]) we have Wi — > oo as N — > oo for each fixed i. Divergence from a 
single item does not cause the divergence of the average, but for b < 1, there are many such items 
which affect averages. 

Before studying this problem, we note that the time evolution of the ranking of a single item 
which we discussed in detail in Section [3] has no problem. Theoretically, this reflects the fact that 
we assume nothing on the distribution A in Proposition [TJ The problem of divergence of the average 
sales rate is theoretically reflected only in the fact that for b < 1 the Assumption (3) to Theorem [2] 
fails. As remarked below Theorem El this affects the distribution at y = 0, the top end of the 
rankings, but no theoretical problem occurs for y > 0. Intuitively speaking, if there are (fictitious) 
book titles which sell 'infinitely many copies per unit time', they keep staying at the top end of 
the ranking, and the rest of 'realistic' book titles follow the evolution of ranking as predicted by 
Proposition [H Also, the contribution to the total sales from the tail side (both S(r, 1) and S(r, 1) 
for r > 0) has no problem of divergence, i.e., asymptotically proportional to N as in (|25p or (|28p . 
In other words, formulas not containing contributions from the 'greatest hits' remain valid: For 
< r 5s 1, the contribution to the total sales per unit time from the iV(l — r) items (out of total 

N) of low sales potentials is as (|25p . S(r, 1) ~ (1 — r^" 1 ^''), and that from the N(l — r) items 
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in the tail side of the ranking is as (|28p with ([19 



S(r,l) ~NabT(l-b,q(r))q( 



\b-l. 



l-e" 9(r) +q(r) b T{l -b,q(r)). 



In particular, we can perform a similar analysis as that concerning Fig. [3] using (|28p . The loss in 
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Fig 4: Ratio of contribution to total sales from lower N(l — r) items in the ranking to that from 
lower N(l—r) items in the sales potential. The upper and the lower curves correspond to b = 0.6312 
and b = 0.7959, respectively. The horizontal and vertical axes are r and S(r, 1) /S(r, 1), respectively. 



total sales per unit time caused by selecting top rN items in the ranking instead of selecting top 
rN items in the sales rate can be measured in terms of their ratio S(r, 1)/S(r, 1). Fig. U] shows this 
ratio as a function of r for 0.01 S r = 0.9, calculated using ([25]) . As a value of b we adopted the 
values in (]14p and (|15j) . The ratio is below 1.6 and insensitive to r in this range. For r near 1, the 
ratio approaches 1, and the use of ranking data is good. (Unlike the case b > 1 in Section [4.2.1 1, 
the ratio remains strictly greater than 1 as r — > 0.) 

Returning to the problem of unrealistic infinity, a simple modification for our approach would 
be to introduce a cut off. Taking logarithms of ([7]) we have 

\ogWi = --\ogi + ^logiV + loga, i = l,2,---,N. (30) 

This formula shows that plotting the sales rates Wi against i on a log-log graph, the points will 
fall on a single line. (This suggests a reason why Pareto distribution is also called log-linear 
distribution and that the exponent —1/6 is called the Pareto slope parameter.) When one assumes 
Pareto distributions in social and economic studies, the argument would be in reverse direction; 
one probably first observes data aligned close to a single line on a log-log graph, and then arrive 
at a idealized theoretical model (]30j) or ([7]). The line actually ends in realistic situations, and (|30j) 
denotes the tail end by w n = a and the head end by w\ = aN 1 ^ . We let iV — > oo in our formulation 
and as a result lost the head end, which causes trouble in average sales rate for b < 1. A simple 
remedy is therefore to introduce a cut-off parameter 7 > or no = jN, and assume a modified 
Pareto distribution, 

logwi = log a — — log 1 n ° , i = 1,2,---,N, (31) 
N + no 



or extend ([7]) as 



Wi = a[ m ) , i = 1,2,3,-.. (32) 
- + n J 
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7 = or no = is the original Pareto distribution. We assume Pareto distribution to be basically 
applicable, so we assume 7 <C 1 (1 <C tiq <C iV). 
Using ([32]) in the left hand side of ([22]) . we have 

lim I^n,^) = + 7) ( (!±X)<i-»/» - (i±l)M/A . (33) 

If 7 = n-o/N = we reproduce ([24"]) . We can safely let 77 — > in ([33]) and find 



(34) 



5(0, r) * + f (^)^ - (!±2)<i-V 

l—o V 7 r+7 , 

In particular, 

^ = 5(0, 1)^^(1+7) ((1 + ^' b » b - l) - ^7- (1 - 6)/b . (35) 

(The left hand side is obtained by taking leading term in 7 <C 1.) Note that we cannot let 7 — » 
for Stat- 

Other quantities can also be derived if we replace ([7]) by (|32p . Following the argument below 
((2J), we have, in place of ([6]), 



dX 



0, > aiV 1 / b (l + 7- 1 ) 1 / b , 

oa 6 (l + 7) 



w b+i 



a<w <aN 1 ' b {l+ 1 - 1 ) 1 l b , (36) 
0, w < a. 

Substituting ([55]) in ([5]) we have, in place of ([U]), 

yc (t) = 1 - b(at) b (l + 7 )r(-6, at) + 6(at) 6 (l + 7 )P(-o, 0*^(1 + 7- 1 ) 1 / 6 ). (37) 

We note that we can take 7 — > limit in (]37p and reproduce Q. In other words, the effect of 7 is 
small for the evolution of ranking yc(t), if 7 is small. In Section [3] we assumed the original Pareto 
distribution, and performed a fit to (jlip which is equal to ([9]). That this works implies that 7 is 
actually small and that ([9]) is a good approximation to (|37j) . In fact, as noted at the beginning 
of this subsection Section 14, 2. 2} the effect of 'greatest hits' on the ranking is that they keep the 
top positions constantly. The ranking data at Amazon websites are updated only once per hour, 
and since there are many books which sell more than one per hour, we never observe ranking 1 by 
tracing (as we do) a book which sells only once per months. For such observations it is intuitively 
clear that taking N — » 00 causes no singularities regardless of the value of b. 

Reversing this argument, we see that since small difference in 7 does not affect the evolution 
of ranking yc{t), we cannot estimate the value of 7 from yc(t)- The dependence on 7 of the total 
sales Stot m (|35p cannot be removed, hence for b < 1, we cannot estimate the total sales of the 
online store from the ranking data. Our method is effective in studying the tail structures, but is 
weak at great hits for b < 1. Standard methods, such as estimating from press reports about top 
hits, should be combined, if the online store is not willing to disclose the total sales. 

Returning to ([35]) , we see that for b < 1 the total sales Stat could be very large (if the cut-off 
parameter 7 is very small) while (|25p implies that S(r, 1), the contribution from the tail side, is 
constant in 7, hence the ratio S(r,l)/St t could be very small. This is in contrast to the case 
b > 1 discussed in Section 14.2.11 where the ratio is significantly away from 0. In this sense, the 
contribution to the sales from the long tail would be modest in general, and the impact of long 
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tail business on economy would be also modest, if b < 1. Our calculations for Amazon.co.jp in 
Section [3] supports b < 1, in spite of the Amazon group's reputation for their long tail business. 
We are however aware that when we talk about possibility of long tail business, there are other 
aspects than the contribution to the total sales or the direct economic impact of long tails. For 
example, the phrase 'the leading retail store' is a highly effective advertisement, and being number 
one, would be quoted by mass media, thereby drastically reduce advertisement cost. We therefore 
will not be amazed if an online bookstore takes a strategy to advertise their long tail business 
model, but is hesitant about disclosing its actual sales achievement, and makes profit largely from 
advance orders of 'great hits' such as Harry Potter series. 



4.3 Conclusions. 

In this paper, we gave a mathematical framework of a new method to obtain the distribution 
of sales rates of a very large number of items sold at an internet retail site which disclose sales 
rankings of their items. We gave explicit formulas for practical applications and an example of a fit 
to the actual data obtained from Amazon.co.jp. The method is based on new mathematical results 
|K.&T. Hattori, 2008a , K.&T. Hattori, 2008b | on a infinite particle limit of the stochastic ranking 



process, and is theoretically new and quantitatively accurate. 

The method is suitable especially for quantitative studies of the long tail structure of online 
retails, which has been expanding commercially with the advance in computer networks and web 
technologies. Calculation algorithm of the ranking numbers is very simple (simplest is the best, 
from the theoretical side), and will be relatively easy to implement online. Hence our theory 
could serve as an efficient and inexpensive method for disclosure policies and regulation purposes, 
as well as for providing the online store business a method of prompt analysis of long tail sales 
structure. (We have heard from a book publisher that Amazon.co.jp are not willing to open their 
sales results. The publisher was amazed to know that we could estimate Amazon's sales structure 
from their rankings.) With a possible future increase in online long tail business, the role of our 
theory in the business disclosure policies may increase its significance. 

Since the result is based on mathematical results, it is in principle applicable to general situations 
such as retail stores with POS systems, blog page view rankings, or the title listings of the web 
pages in the collected web bulletin boards. In fact, we collected a preliminary data from 2ch.net, 
one of the largest collected web bulletin boards in Japan, performed a fit to (fl~3j) . and obtained a 



value b = 0.6145 for the Pareto exponent, which is close to (JHJ). See K.&T. Hattori, 2008b for 
details. In the 2ch.net title listing page, the titles are ordered by 'the last written threads at the 
top' principle, which matches the definition of the stochastic ranking process in Section [2l 

The method would be useful for marketing purposes as well as studies in social activities in 
general, thus we consider it worthwhile to disclose the method for free use in practical situations. 
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